An optimization method for the inverted pendulum problem based on deep reinforcement learning

نویسندگان

چکیده

Abstract The inverted pendulum problem is a classical problem. starting at random position keeps moving upwards and aims to reach an upright position. has been solved through some methods based on deep reinforcement learning (DRL) such as Deep Deterministic Policy Gradient (DDPG). However, DDPG also disadvantages. policy not conducive action exploration. Moreover, the Q value needs be estimated reasonably accurately for accurate. Nevertheless, beginning of learning, there certain difference in estimation, parameters learned this time are easy deviate. Therefore, paper combining AdaBound with algorithm proposes optimization method problem, compares performance that four published baselines. experimental results show proposed outperforms above baselines extent.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Aggregation-Based Learning in the Inverted Pendulum Problem

We consider the problem of adapting approximate dynamic programming techniques to the inverted pendulum task. This is a particularly challenging task as we work with a relatively uninformative reinforcement signal and have no a priori information about our system. Success in this task requires an effective solution to the credit assignment problem, incorporation of noisy and biased information ...

متن کامل

Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm

: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...

متن کامل

Reinforcement Learning with Perturbation Method to Turn Unidirectional Linear Response Fuzzy Controller for Inverted Pendulum

In this paper, we present a unidirectional linear response fuzzy controller (FC) to control the inverted pendulum system. The performance of turning fuzzy controller is defined as an evaluation function and our proposed technique, which is based on the integration of reinforcement learning and a perturbation method, is utilized to diversity the search of minimization of the evaluation function....

متن کامل

Control of Inverted Double Pendulum using Reinforcement Learning

In this project, we apply reinforcement learning techniques to control an inverted double pendulum on a cart. We successfully learn a controller for balancing in a simulation environment using Qlearning with a linear function approximator, without any prior knowledge of the system at hand. We do however fail to learn a controller for the swingup maneuver, which leads to a discussion on what mig...

متن کامل

Fundamental Constraints for the Inverted Pendulum Problem

This paper considers fundamental constraints that exist in the control of an inverted pendulum system. The inverted pendulum system is a single input two output (SITO) system which has an unstable pole. The limitations that are developed use several diierent techniques to show the diiculties in the control of this system. The analysis has applications in the control of SITO systems that have un...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of physics

سال: 2022

ISSN: ['0022-3700', '1747-3721', '0368-3508', '1747-3713']

DOI: https://doi.org/10.1088/1742-6596/2296/1/012008